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BitSim: An Algebraic Similarity Measure for Description Logics Concepts 


Description Logics, Semantic Similarity 


Abstract 

In this paper, we propose an algebraic similarity 
measure oss stands for BitSim) for assigning 
semantic similarity score to concept definitions in 
- an expressive fragment of Description 
Logics (DL). We define an algebraic interpretation 
function, Jg, that maps a concept-definition to a 
unique string {oj^) (called bit-code) over an alpha¬ 
bet 2s of 11 symbols belonging to Xs - the lan¬ 
guage over 2s- ^s has semantic correspondence 
with conventional model-theoretic interpretation of 
DL. We then define on Xs- A detailed analysis 
of and o-^s has been given. 


1 Introduction 

Semantic similarity measure serves as the foundation of 
knowledge discovery and management processes such as on¬ 
tology match ing, ontology alignment & m apping, ontology 
merging, etc |Shvaiko and Euzenat, 201 3| . Ontological con¬ 
cept similarity can be based on different approaches: (i) 
string matching of c oncept labels (i.e. lexical similarity) 
IStoilos et al, 2Q05| , (ii) external lexical resou rce/ontology 
based m atching (i.e. lexica-semantic similarity) | [Rada et 
1989a| , (i ii) graph-based matchi ng using lexicons such as 


WordNet l |Stuckenschmidt, 2007 1 (i.e. structural si milarity), 
(iv) property analysis (as in FCA-based similarity |Cimiano 


et al, 200^) o r instance analysis (as in Jaccard similarity 
l |Jaccard, 1998| ) based matching over a large sample of con¬ 
cept instance occurrences (i.e. instance-driven similarity), 
(v) matching based on statistical analysis of attribute-value 
or distribution analysis within fixed context-windows of con¬ 
cepts over lar ge corpora (i.e. statistical similarity) I Li and 


Clifton, 1994| , and (vi) model-theoretic matching of formal 
concept descrip tions (i.e. formal semantic similarity) |Alsub 
ait et al, 2014| . 


It can be argued that, in comparison to other approaches, 
formal semantic similarity measure modeling has not re¬ 
ceived equal research attention. Nevertheless, existing litera¬ 
ture is significant, and can be broadly clas sified into two ap¬ 
proaches: (i) Propositional Logics bas ed |Nienhuys-Cheng, 
1998[ Ramon and Bruynooghe, 1998|, and ( ii) Description 


Logics (DL) basecTlAlsubait et al, 2014\ 


__ Lehmann and 

ITurhan, 2012[ [Stuckenschmidt, 2007i [Fanizzi and dAmato, 


2006[ Borgida et al, 20d5| . The former requires: (a) rep¬ 
resentation of ontologies (mostly in RDFS/OWL format) in 
First Order Predicate Logic, (b) a set of axioms (or domain 
knowledge, mostly as upper ontologies/thesaurus), and (c) a 
SAT solver that checks satisfiability (and hence, satisfiabil¬ 
ity) of disjointness of concept pairs. The latter approach, on 
the other hand, does not necessarily require any formal lan¬ 
guage transformation or satisfiability checker. In this paper 
we propose an algebraic similarity measure, called BitSim 
that can compute semantic similarity of pair of con¬ 
cepts defined in ^XC7 T-fQ The motivation behind o-^s is 
to formulate a formal semantic similarity measure that pro¬ 
vides: (i) a platform for fast, scalable, and accurate semantic 
similarity computation of DL concepts, and (ii) a sound and 
complete correspondence with conventional semantic inter¬ 
pretation of DL. is algebraic, in the sense that it maps a 
given pair of concept codes (called bit-code), instead of con¬ 
cept DL definitions/axioms, to a positive real space. For this 
we define a novel algebraic interpretation function, called Jg, 
that maps an ^XCXf+ definition to a unique string, called 
bit-code, {oj^) belonging to the language Xs defined over a 
novel algebraic alphabet 2s- We prove that Jg has com¬ 
plete correspondence with We also show that cr^s 

is highly adaptive to any kind of similarity measure that re¬ 
lies on set operation. As an example, we have shown how 
cr^s can be plugged into Jaccard similarity index. The con¬ 
tribution of the paper is as follows: 

• : A novel algebraic semantic interpretation function 

for P{£CP{+ . 


• Proof of mathematical correspondence of with se¬ 
mantic interpretation of J?lXCXf+ . 

• o-^s • A novel semantic similarity measure based on 

• Comparative analysis of properties of with contem¬ 

porary DL based similarity measures. 


2 Related Work 


DL based similarity measures, as described in the introduc¬ 
tion, can be further su b-divided into: (i) taxonomic analysis, 
(ii) structural analysis i Tongphu and Suntisrivaraporn, 2014[ 
Ontanon and Plaza, 2Q12pjoslyn et al, 2008 1 [Hariri et al^ 
2006| l, (iii) language approximation | |Stuckenschmidt, 2007 [ 


^yiXC‘K+ : PlJlCl-i U {B^lMION^^INT&ns&cTION] 



































































Tserendorj et al, 2008 [ Groot et al, 20051 [Brandt et aL\ 
2002||, and (iii) model-theoretic analysis l|Distd et al, 201 4} 
Alsubait et al, 20l^ [Lehmann and Turhan, 2012[ |Borgida| 
et al, 2005| . The most common approach for DL based 
similarity m easure modeling adopts taxonomic analysis as 
proposed in [Rada et al, 1989b[ [Resnik and others, 1999 


Jiang and Conrath, 1997[ Wu and Palmer, 1994 [Lin, 1998] !. 

These techniques can be further sub-divided, a s mentioned 
in intro duction, into graph-traversal approac hes iRada et al , 


1989b| an d Information-Content approaches I Resnik and oth 


ers, 1999| . However, these methods can work on a general 


ized ontology and hence, are not sensitive to DL definitions. 


In structural analysis based approaches, a similarity mea¬ 
sure is designed to capture the semantic equivalence of de¬ 
scription trees of definitions of DL concept pairs. One way 
of achieving this is to calculate the de gree of homomorphism 
between such tr ees, as proposed in [ [Tongphu and Suntisri 


varaporn, 2014| . A refinement graph based anti-unification 


approach has been proposed in jOntanon and Plaza, 2012| 
for computing instance similarity. Approximation based tech¬ 
niques aim at converting given DL expression to another 


lower expressive DL language on approximation. In I Stuck- 


enschmidt, 20071 an upper and lower approximation interpre- 
tation for S^Iu have been defined over a sub-vocabulary of 
S9iIQ. The sub-vocabulary can be formed by either remov¬ 
ing concepts atoms in a given defini tion or by replacing them 
with structurally simpler concepts I Groot et al ., 2005| . An¬ 
other technique, as proposed in |Noia et al, 20Q4| ,ls based 
on converting user query into a DL expression and try to clas¬ 
sify the match to be either an exact match, or a potential 
match (i.e., match might happen if some concept atom and 
operators are added) or a partial match (where the user query 
and answer/description found in the knowledge base are in 
conflict). 


One of the pioneer work on model-theoretic interpreta¬ 
tion based similarity approach can be found in |Borgida ~t 
al, 2005 1 . The work shows the inherent difficulty in mea¬ 
suring similarity of DL concepts using conventional taxo¬ 
nomic analysis based techniques. It then uses an Information- 
Content based approach to evaluate the similarity o f two con¬ 
cept definitions. A work has been proposed by |Lehmann 
and Turhan, 2012| for similarity computation of concepts de¬ 
fined in S£fH. In this work, a Jaccard Index I jJaccard, 1998| 
based approach has been followed that compares common 
parents of a concept pair using di fuzzy connector (i.e. a sim¬ 
ilarity score aggregation function). A simila r Jaccard Index 
based approach has been recently proposed in [ [Alsubait et al, 
20141. Anot her recent approach has been proposed in | |Distel 
et al, 201^ . The work emphasizes the necessity of triangle 
inequality property of formal semantic similarity measure. It 
defines two versions of a relaxation function for computing 
dissimilarity of concepts defined in However, it can be 
proved that triangle inequality is not always valid and hence, 
is not a necessary condition. 


3 Preliminaries 

3.1 Jl£C9-(+ - A Description Logics Fragment 

We hereby define the semantic interpretation of (an 

extension of which also interprets current OWL 2 specifica¬ 
tion!^ Let I be an interpretation function, and A be the uni¬ 
versal domain. ^ is defined as: 

• IRJL (Attributive Language): 

- Atomic concept: A^ c A^ 

- Role: r^ Q Af X Af 

- Atomic Negation: = A^ \A^ 

- Top Concept: T = A^ 

- Bottom Concept: J_ = 0^ 

- Conjunction: (C n D)^ = C^ DD^ 

- Value Restriction: (ffR.CY = [a \ Vb.(a, b) e R^ 
^beC^} 

- Limited Existential Restriction: (3R.T)^ = [a \ 
3b.(a,b)eR^ Lb eA^ } 

• 6 (Full Existential Restriction): (3R.C)^ = {a \ 
3b.(a,b)eR^ Abe C^ } 

• C (Concept Negation): -^C^ - A^ \C^ 

• Li (Role Hierarchy): RY Q Ry^ 

• i^ytNiON (Role Union): RY U Ry^ 

• HiNT&RSSCTiON (Role Intersection): RY H R 2 ^ 

3.2 Formal Similarity Measure 

In this section we define the alge braic properties of cr as given 
in [Lehmann and Turhan, 201^ . Let C is a DL concept in a 
given terminology (T-Box) 7“. 

Definition 1: A semantic similarity measure cr is a function 
defined as follows: 
cr : C X C [0,1] where C eT 

Properties of Similarity Measure: Arguabl}0 cr should 
hold the following properties: 

Positiveness : VQ, Cj e T, cr(Ci, Cj) >0 (1) 

Refiexive : VC,- e T, (r(Ci, C,) = 1 (2) 

Maximality : VC/, Cj, Ck e T, cr(Ci, C/) > cr(Cj, Ck) 

(3) 


Symmetry* : VC/, Cj e T, cr(Ci, Cj) = o-(Cj, C/) (4) 

Equivalence Closure : VC/, Cj e T, cr(Ci, Cj) = 1 

^Ci = Cj 


Equivalence Invariance : VC/, Cj, Ck e T, Ci = Cj 

cr(Ci, Ck) = cr(Cj, Ck) 


( 6 ) 


^The syntax of LiECL{+ follows conventional DL as defined in 
[Baader, 2003 1 

denotes that the property is not universally adopted as neces¬ 
sary condition. Also it cannot be proven to be valid in all types of 
algebraic spaces. 













































































Figure 1: Lattice Structure of Eg 

Structural Dependency : VC^, cl, Ct, Cj eT, 3 

Cl = \~] {Ck n C,); 

k<n 

(7) 

and, C;! = PI {Ck n Cj)\ 

k<n 

then, lim cr{0^,C{) = 1 

X—^OO 

Subsumption Preservation : VC/, Cj, Ck e T, 

Ci E Cj ^Ck ^ cr(Ci, Cj) > cr(Ci, Ck) 

Reverse Subsumption Preservation : VC/, Cj, Ck e T, 

CiQCjQCk cr(Cj,Ck)>cr(Ci,Ck) 

(9) 

Strict Monotonicity : VC/, Cj, Ck ^T,3 
if, VCi e T, Ci Q Cl', Cj QCi CkQCi 
then, cr(Ci, Ck) > cr{Ci, Cj) 
and if, 3C^ ^T,3 Ci Q Cm\CkQ Cm and Cj % Cm 
then, cr(Ci, Ck) > cr(Ci, Ci) 

( 10 ) 

It should be noted that the aforementioned necessary prop¬ 
erties may not be sufficient and hence, detailed theoretical 
analysis has to be done on sufficiency. 

4 Xs* Formal Language for Concept Coding 

In this section we define the formal language Xg on which 
the proposed algebraic interpretation function Jg is defined. 
We first define bit (Eg), the alphabet of Xg , as follows: 

Definition 2: A base alphabet (Eg^^^J is an alphabet de¬ 
fined as: Eg^^^^ = {0,1} where 

• 0 is the empty symbol. It is also called potential bit since 
it generates all other symbols (i.e. bits) 

• 1 is base bit, called property bit, signifying the presence 
of a property at the string position that it holds. 

Definition 3: An bit operator (©g) is a set of operators on 
the base bits defined as: ©g = {Ug, rig, -ig}. 

We now define a very important semantics for potential bit 
(i.e. 0) as follows: 

-sO = 0 (1) 

Definition 4: An derived alphabet (^Sderived) 1^ alphabet 
defined as: Eg,^^.^^, = {X,X",X', Y, Y', V, F,T, _L} where 


• X" = “"gl 

• X = 1 Ug 0 

• X' = X" Ug 0 

• T' = 7 Ug r 

• r = 1 Hg 0 

• 7 = X" Hg 0 

• _L' = X Hg X' 

Based on the above definition the following observations 
can be made (using de Morgan’s law): 

• X = ^g7 

• X' = -gX' 

• T' = -g_L' 

• T = -igJ_ 

A further analysis shows that Eghas a partial ordering <g (as 
shown in figure 1). It is interesting to note that Z?/ rig -iZ?/ V 
_L;VZ?/ eEg. 

Definition 5.1: An bit-alphabet (Eg) is defined as 

^Sbase ^^derived’ 

It is to be noted that ©g satisfies commutativity and double 
negation over Eg. 

Definition 5.2: A quantifier-alphabet (Eg^) is defined as 
Eg^ = {1,0,X, 7'}. The algebraic space of Eg^ is defined as 
below: 

• X = 1 Ug 0 

• r = 1 Hg 0 

• "’sl — 0 

• -gr = X 

Definition 5.3: A role-alphabet (Eg^) is defined as 

= {1,0,X, 7'}. The algebraic space of Eg^ is defined as 
below: 

• X = 1 Ug 0 

• r = 1 Hg 0 

Definition 6.1: A base bit-code (cosba^e) defined as oj^ba^e- 
0* Og b* Og (o||(ogZ7v o ||)|Kog(Z7g.)^o)|| Og z?^)* Og (o||(og ^3 o 
||(o)|KogZ?g.)+o)|| Og bf)* where og is bit concatenation oper¬ 
ator; Z?v, b^ € Eg^; Z?v = 1, Z ?3 = 0; Z?/ € Eg; Z?g. e Eg^. 
Definition 6.2: A derived bit-code (o^Sdenved) defined as 

where o is 

String concatenation operator. 

It can be observed that the definition of 6L)g^^^.^is recursive. 
We leave the explanation and utility of the definition in sec¬ 
tion 5.3. 

Definition 7; Xg is defined as Xs = { U { 

5 Encoding Jl£,C^+ Concept 
5.1 Motivation 

The motivation behind BitSim {cibs ) is to develop a formal, 
efficient, and scalable matchmaking system that can be ap¬ 
plied in DL based knowledge bases. Unlike other DL based 
similarity measures, crgg was designed to satisfy all the nec¬ 
essary properties defined in section 3.2 with special emphasis 
on structural dependency and strict monotonicity. 




Figure 2: Atomic Concept Encoding: Example 


At the same time, cr^s computation is over Xs ^ rather 
than . This significantly improves the computa¬ 

tional speed since essentially becomes a function over 
pairs (such bit-codes can be computed and stored off-line 
in the knowledge base). Since, cr^s computation is performed 
on pairs of bits holding the same position in , therefore we 
can chunk bit-codes in constant sizes and perform similarity 
over concept pairs on parallel computational platforms. This 
gives massive scalability to • Efficient optimization can 
be performed by caching similarity results of bit-code chunks 
that are frequently visited. 

We will also show that, at a bit level, has a partial 
ordering This allows application-oriented assignment 

of similarity scores to bit pairs at the lowest granularity. Also, 
(T^s is highly adaptive to all types of similarity measures that 
have set theoretic operations on DL concepts. 

5.2 Encoding Atomic Concepts 

Before we show that Xs has complete correspondence with 
J^XC7T-r , we first provide the foundational axioms that helps 
us to encode atomic concepts in ^XC7T-r . For that we need 
to define the proposed algebraic interpretation function 
(also called bit-interpretation). 

Definition 8: Bit-interpretation (Jg ) is a function as fol¬ 
lows: 

: C Xs ; C € T^£C9i+ 

We hereby define where At is an arbitrary atomic 

concept in , using the following two axioms: 

Fundamental Axiom of Atomic Concepts 


Axiom of Significant Property Bit 


3h 


A 






0 * 


( 2 ) 


where, k is the ^-th position (in increasing order from right 
to left) in . In the second axiom holds the significant 
property bit. 

We now show the method to encode inclusion axioms on 
atomic concepts using the following two axioms: 


Axiom of Property Bit Inheritance 

VA/, Aj e T^£c^+, 3 Ai QAj, 

€ cjtj 


= 1 


(3) 
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Figure 3: Algebraic Space of b^j 


Axiom of Atomic Bit-Code Uniqueness 

Given, VA/, Aj e T^£C£t+, 3 Aj % A/ ( 4 ) 

3bl‘ G b'l’ €<j'£b[ = \& b{i^\ 

The above four axioms ensure that a simple boolean inter¬ 
section between any pair of and generates the bit- 
code of the least common subsumer (Ics) atomic concept. An 
example encoding instance has been illustrated in figure 2 . 
For atomic roles we can use the same principle of encoding 
with an alphabet that corresponds to • 

5.3 Encoding Derived Concepts 

For encoding derived concepts, we cannot attain complete¬ 
ness using 6 G)s^^^^only. This is because, for a bounded number 
of atomic concepts (say, n) we need a mechanism to encode 
2^” distinct and disjoint concepts, in the worst case. How¬ 
ever, with only 11 bits in we can only encode 11 " distinct 
concepts. It is because of this reason that we need to use 
oj^derived' method is to have nested encoding for bit oper¬ 
ations over certain special bit pairs. These operations do not 
have direct mapping to the algebraic lattice shown in figure 
1. Instead they map to intermediate and discrete compound 
bits (b^j). which can be represented in terms of We define 
/?^as follows: 

Definition 9: A compound bit is defined as: 
boj= {^s<^Sdenved°)^ wfiere the algebra of b^j is defined as 
shown in figure 3. 

For any derived concept C/, we can state the following: 

Axiom of Binary Concept Operation 

Based on axiom 5 we can state that: 

C' tj ■ 

Lemma 1. For any binary operation between and 

C' c • 

the length of both the operands, b^jf and boj^\ must be same; 
where k: k-th position ofb^ in oj^ . 

c c 

Lemma 2. For any binary operation between b^jf and 

C- © c • 

the length of the resultant b^jf ^ has growth = 0(c^), where 
c € { 1 , 2 ,3} and n is length of operand b^j. 

Lemma 3. If3bk ^ oj^ 3 bk = T, then oj^ = T 

Lemma 4. If3bk e oj^ 3 bk = -L, then co^g = ± 

Lemma 5. If'ibk ^ 3 bk = T', then oj^ = T 

Lemma 6. If^bk e oj^ 3 bk = ±', then co^ = ± 






We now postulate the following axioms for operations over 
derived concepts: 


Axiom of Concept Negation 


( 6 ) 


Axiom of Binary Concept Operation 

(Q ® CjY^ = ®s (7) 

= [b^k * where ®s = ; ® e {u, n) 

Axiom of Universal Role Restriction 

(0\ 

{VRj.Cif^ = \\{oeblo)\\\\{os{b^^Jo)\\ os } 


Axiom of Full Existential Role Restriction 

QRj.Cif^ = IK°A^o)|||Kos{Z,J)o)|| og [b^^‘] 


It is to be noted that for the above axioms can be both 
a simple and a compound bit. We now provide a proof for 
showing the mathematical correspondence between (Xs 
and Y- 

c c c c 

Lemma 7. e e k: k-thposition in oj^ 

b^,‘<sbl‘ / \ c (j(p^ j:c'H+ 


Most Similar 


Least Similar 


^11 


A'"A' 



jy 


Y'X^ 

^B 

^B 

yX" 

^B 




^A'O 

^B 

V'O 

^B 

^B 


Proof. Proof can be derived from the lattice structure of Eg 
(see figure 1.) □ 


6 BitSim Similarity Measure 
6.1 Outline 


Following the above lemma we can state that: 

Theorem 1. (Qf c (Cj/ ^ (Ci/^ c 

Theorem 2. (Q ®s Cj/ <=^ (Q ®s ; ®s e {U, n} 


Proof. VZ>^‘ e e ; 

bl ‘' = bl‘ ®s bl^ ^ bl‘ bl’-,bl‘ ' 

= n; else converse over 

Theorem3. (QR.QY (QR.CiY^;Q e {V,3} 



Proof. From axiom 7 and 8, we can show that (QR.CiY^ is 
unique. This is because is unique. In other words, the 
algebra has complete correspondence with (QR.CiY- □ 


In this section we provide a generic definition for BitSim. We 
first define (i-^- similarity at a bit level) as follows: 

Definition 10: x bj \-^ [0,1]; where bi, bj e Eg. 

It is to be noted that bi can be as well. One can see that 
has a total order ^^-^^(see figure 4). In order to compute 
similarity at a bit-code level, we define an aggregation func¬ 
tion called &^s- There are two parameters that should infiu- 
ence the value output of d-^s'- (i) and (ii) code-generativity 
(PQ). We define code-generativity as follows: 

Definition 11: Code-generativity (Tc^) of any of a con¬ 
cept is the total number of distinct and disjoint concepts that 
are covered by the oj^ • 

As an example, rcq(XXl) = 3; (i.e. ll'l. I'll, 111). We 
now define the similarity measure at a bit-code level (we call 
it ^ss) as follows: 

Definition 12: {oss k x (Tcg)k x i-^ [0,1]. 


Theorem 4. V e , oj^ is unique. 


We now postulate the following axioms: 


Proof. Follows from axiom 4 and lemma 1-6. □ 

Theorem 5. Jl£CP(-\- ^ has complete correspondence with 


□ 


Axiom of 0-bit Similarity 

Vk; ctbs (Ok, 0^) is ignored. 


( 10 ) 


Axiom of T - bit Similarity 

G Eg; crBs(y,a) = \ if a = T\ else undefined. 

( 11 ) 


Proof. The proof follows from theorem 1-3. 
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Figure 5: Comparative Analysis of BitSim 

Axiom of _L - bit Similarity 

€ Eg; = 1 if a = else undefined. 

( 12 ) 


same time, various caching techniques can be applied effi¬ 
ciently. In the future we will be exploring these research 
prospects and other possibilities such as probabilistic reason¬ 
ing on Xg , A-Box reasoning, and reasoning over higher ex¬ 
pressive DL. 

8 Conclusion 

In this paper we have proposed BitSim (crgg ) - an algebraic 
similarity measure for concept definitions in . We 

show that (Tgg satisfies all the necessary algebraic properties 
recommended for a formal similarity measure. Being based 
on Jg , iTgg is highly sensitive to standard DL interpretation. 
Furthermore, crgg is highly adaptive to any similarity mea¬ 
sure that uses set theoretic operations. 
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